Defining Antonymy: A Corpus-based Study of Opposites Found by Lexico-syntactic Patterns
ثبت نشده
چکیده
Using small sets of adjectival seed antonym pairs, we automatically find patterns where these pairs co-occur in a large corpus of Dutch, and then use these patterns to extract new antonym pairs. Evaluation of extracted pairs by five human judges showed that automatic scores correlate with human evaluation and that pattern based methods can be used to extract new antonym pairs. The majority of extracted pairs were noun-noun pairs, contrary to expectations based on previous research. Additionally, the method identifies a subgroup of co-hyponyms that frequently function antonymously, and together with more traditional antonyms makes up a wider class of incompatibles, suggesting that antonymy is a diverse relation that includes pairs of different types and categories that are not captured by any single linguistic theory. Comparison with Dutch WordNet and an online Dutch dictionary shows that only a handful of extracted pairs are currently listed in these existing resources, emphasizing the usefulness of the project.
منابع مشابه
Corpus Approaches to Antonymy
The word “antonymy” was coined in 1867 by CJ Smith to describe word-pairs commonly known as “opposites” such as hot/cold, girl/boy and buy/sell. Some linguists (e.g. Lyons (1977) and Cruse (1986)) apply the term “antonymy” restrictively and would only identify the first of these three pairs as being truly antonymous. However, this seems counter-intuitive in many respects, and this paper will us...
متن کاملPattern-Based Distinction of Paradigmatic Relations for German Nouns, Verbs, Adjectives
This paper implements a simple vector space model relying on lexico-syntactic patterns to distinguish between the paradigmatic relations synonymy, antonymy and hypernymy. Our study is performed across word classes, and models the lexical relations between German nouns, verbs and adjectives. Applying nearest-centroid classification to the relation vectors, we achieve a precision of 59.80%, which...
متن کاملAutomatic Extraction of Turkish Hypernym-Hyponym Pairs From Large Corpus
In this paper, we propose a fully automatic system for acquisition of hypernym/hyponymy relations from large corpus in Turkish Language. The method relies on both lexico-syntactic pattern and semantic similarity. Once the model has extracted the seeds by using patterns, it applies similarity based expansion in order to increase recall. For the expansion, several scoring functions within a boots...
متن کاملAutomatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes
Detecting the linguistic scope of negated and speculated information in text is an important Information Extraction task. This paper presents ScopeFinder, a linguistically motivated rule-based system for the detection of negation and speculation scopes. The system rule set consists of lexico-syntactic patterns automatically extracted from a corpus annotated with negation/speculation cues and th...
متن کاملUsing Web Data to Investigate Antonym Canonicity
In the literature, some researchers (e.g. Gross, Fischer and Miller 1989, Charles, Reed and Derryberry 1994) treat antonym pairs as either canonical (for example old/young, cold/hot and happy/sad) or non-canonical (aged/youthful, cool/hot, happy/miserable), while others assume or argue for a continuum between the two categories (e.g. Herrmann, Chaffin, Conti, Peters and Robbins 1979, Murphy 200...
متن کامل